A Memory-Efficient Algorithm for Large-Scale Symmetric Tridiagonal Eigenvalue Problem on Multi-GPU Systems
نویسندگان
چکیده
Divide-and-conquer algorithm is a numerically stable and efficient algorithm that computes the eigenvalues and eigenvectors of a symmetric tridiagonal matrix. We often face the situation where the input matrix fits into the main memory but not into the on-chip memory of a GPU device. We present an out-of-core implementation where only part of the input matrix is resident in GPU memory at any point in time. It works independently of the physical size of GPU memory, handling any size of input as long as it fits into the main memory. Work is dynamically allocated to multiple GPUs and CPU cores, taking account of available workspaces and progress of the algorithm. In addition, it delivers a performance comparable to that of conventional multi-GPU implementations for cases where workspaces fit into the GPU memory.
منابع مشابه
MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures
The computation of eigenvalues and eigenvectors of symmetric tridiagonal matrices arises frequently in applications; often as one of the steps in the solution of Hermitian and symmetric eigenproblems. While several accurate and efficient methods for the triadiagonal eigenproblem exist, their corresponding implementations usually target uni-processors or large distributed memory systems. Our new...
متن کاملAccelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
We present a Hessenberg reduction (HR) algorithm for hybrid systems of homogeneous multicore with GPU accelerators that can exceed 25× the performance of the corresponding LAPACK algorithm running on current homogeneous multicores. This enormous acceleration is due to proper matching of algorithmic requirements to architectural strengths of the system’s hybrid components. The results described ...
متن کاملMAGMA: A Breakthrough in Solvers for Eigenvalue Problems
The Matrix Algebra on GPU and Multicore Architectures (MAGMA) project aims to develop the next generation of LAPACK/ScaLAPACK-compliant linear algebra libraries for heterogeneous multicore-GPU architectures. The functionality covered in LAPACK and ScaLAPACK is a fundamental building block for many scientific computing applications, underlining the importance and potential for broad impact in de...
متن کاملStatic versus dynamic heterogeneous parallel schemes to solve the symmetric tridiagonal eigenvalue problem
Computation of the eigenvalues of a symmetric tridiagonal matrix is a problem of great relevance. Many linear algebra libraries provide subroutines for solving it. But none of them is oriented to be executed in heterogeneous distributed memory multicomputers. In this work we focus on this kind of platforms. Two different load balancing schemes are presented and implemented. The experimental res...
متن کاملAn O(n Log 3 N) Algorithm for the Real Root and Symmetric Tridiagonal Eigenvalue Problems an O(n Log 3 N) Algorithm for the Real Root and Symmetric Tridiagonal Eigenvalue Problems
Given a univariate complex polynomial f(x) of degree n with rational coeecients expressed as a ratio of two integers < 2 m , the root problem is to nd all the roots of f(x) up to speciied precision 2 ?. In this paper we assume the arithmetic model for computation. We give an algorithm for the real root problem: where all the roots of the polynomial are real. Our real root algorithm has time cos...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014